EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching
نویسندگان
چکیده
Recently convolutional neural network (CNN) promotes the development of stereo matching greatly. Especially those end-to-end stereo methods achieve best performance. However less attention is paid on encoding context information, simplifying two-stage disparity learning pipeline and improving details in disparity maps. Differently we focus on these problems. Firstly, we propose an one-stage context pyramid based residual pyramid network (CP-RPN) for disparity estimation, in which a context pyramid is embedded to encode multi-scale context clues explicitly. Next, we design a CNN based multi-task learning network called EdgeStereo to recover missing details in disparity maps, utilizing mid-level features from edge detection task. In EdgeStereo, CP-RPN is integrated with a proposed edge detector HEDβ based on two-fold multitask interactions. The end-to-end EdgeStereo outputs the edge map and disparity map directly from a stereo pair without any post-processing or regularization. We discover that edge detection task and stereo matching task can help each other in our EdgeStereo framework. Comprehensive experiments on stereo benchmarks such as Scene Flow and KITTI 2015 show that our method achieves state-of-the-art performance.
منابع مشابه
Pyramid Stereo Matching Network
Recent work has shown that depth estimation from a stereo pair of images can be formulated as a supervised learning task to be resolved with convolutional neural networks (CNNs). However, current architectures rely on patch-based Siamese networks, lacking the means to exploit context information for finding correspondence in illposed regions. To tackle this problem, we propose PSMNet, a pyramid...
متن کاملReal-time Line Detection and Line-based Motion Stereo
Recognition of shape is one of the fundamental problems in computer vision. A number of fast line detection and line-based depth recovery algorithms have been developed on various parallel architectures to meet the requirement of real-time robotic vision. This thesis describes a parallel and hierarchical (pyramidal) approach to fast Hough line detection and line-based motion stereo. The lines a...
متن کاملFast global stereo matching via energy pyramid minimization
We define a global matching framework based on energy pyramid, the Global Matching via Energy Pyramid (GM-EP) algorithm, which estimates the disparity map from a single stereo-pair by solving an energy minimization problem. We efficiently address this minimization by globally optimizing a coarse to fine sequence of sparse Conditional Random Fields (CRF) directly defined on the energy. This glob...
متن کاملStereo Vision and 3D Reconstruction on a Distributed Memory System
An important research topic in image processing is stereo vision. The objective is to compute a 3-dimensional representation of some scenery from two 2-dimensional digital images. Constructing a 3-dimensional representation involves finding pairs of pixels from the two images which correspond to the same point in space. Several stereo matching algorithms are developed to find matching pairs. Hi...
متن کاملAn Automated Approach to Stereo Matching Seasat Imagery
This paper describes a procedure developed at University College London for the automatic stereo matching of SAR imagery from NASA's Seasat satellite. The method employed uses Gruen's least squares correlation technique to improve the match accuracy of randomly generated points as they are cascaded down an image pyramid, coupled with a sheet growing mechanism in order to produce a dense array o...
متن کامل